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STEROIDOGENIC FACTOR-1 PROTEIN VARIANTS 
AND METHODS OF MAKING SAME 

This application claims benefit of the filing 
date of U.S. Provisional Application No. 60/395,371, 
5 filed July 12, 2002, and which is incorporated herein by 
reference . 

This invention was made with government support 
under POl DK58390 from the National Institute of Health. 
The government has certain rights in this invention. 

10 BACKGROUND OF THE INVENTION 

FIELD OF THE INVENTION 

The present invention relates generally to the 
fields of nuclear receptors and protein expression and, 
more specifically, to mutants of steroidogenic factor-1 
15 and related monomeric nuclear receptors with improved 
properties . 

BACKGROUND INFORMATION 

The orphan nuclear receptor steroidogenic 
factor-1 (SF-1) is a monomeric binding nuclear receptor 
that is essential for endocrine organ proliferation, 
peptide hormone expression and for all steroid 
metabolism. Recombinant SF-1 protein has been expressed; 
in E. coli, SF-1 protein is soluble and expressed at 
reasonable levels. However, a variety of in vitro assays 
require high concentrations of purified protein. 
Unfortunately, SF-1 aggregates and is unsuitable for use 
at high concentrations in vitro. As an example. 
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relatively high concentrations of conf ormat ionally 
uniform protein are required to prepare crystals suitable 
for X-ray crystallography studies. Because wild type 
protein produces twinned crystals, X-ray structural 
5 analysis cannot be performed. Similarly, wild type 

protein is not monomeric at the concentrations needed for 
functional analysis of the receptor using in vitro 
peptide binding assays. In brief, any application that 
requires high levels of purified protein cannot be 

10 performed with the SF-1 protein preparations currently 

available. Thus, there is a need for novel forms of SF-1 
that exhibit increased stability of the monomeric form of 
. the receptor and that remain monomeric at high protein 
concentrations. The present invention satisfied this 

15 need and provides related advantages as well. 

SUmARY OF THE INVENTION 

The present invention provides a properly 
folded steroidogenic factor-l {SF-l)-like receptor 
variant, or active fragment thereof, which has an amino 

20 acid sequence that encodes a SF-l-like receptor or an 
active fragment thereof and that lacks at least one 
naturally occurring cysteine residue within the 
" ligand-binding domain of the receptor. Such a properly 
folded SF-1 receptor variant or active fragment thereof 

25 can exhibit increased monomer stability as compared to 

the analogous receptor that retains the cysteine residues 
lacking in the variant. In one embodiment, at least one 
naturally occurring cysteine residue within the 
ligand-binding domain of the SF-l-like receptor is 

30 substituted with a non-cysteine residue. 
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A SF-l-like receptor variant or active fragment 
of the invention can be, for example, a SF-1 receptor 
variant or a liver related homolog 1/f etoprotein 
transcription factor (LRH-l/FTF/SF-13) receptor variant, 
5 or .an active fragment thereof. An active fragment of the 
invention can include, for example,, the ligand-binding 
domain of a SF-1- like receptor. In one embodiment, the 
invention provides an active fragment containing a SF-1 
like receptor variant ligand-binding domain without 

10 additional sequence from the same receptor. In another 
embodiment, the invention provides an active fragment 
containing a ligand-binding domain and activation 
function 1 (AFl) of a SF-l-like receptor. In a further 
embodiment, the active fragment contains a ligand-binding 

15 domain and AFl without additional sequence from the same 
receptor. ^ 

A SF-l-like receptor variant or active fragment 
of the invention lacks at least one naturally occurring 
cysteine residue within the ligand-binding domain of the 

20 receptor. In one embodiment, the invention provides a 
SF-l-like receptor variant, or active fragment thereof, 
which lacks at least two naturally occurring cysteine 
residues " within the ligand-binding domain of the 
receptor. In another embodiment, the invention provides 

25 a SF-l-like receptor variant or active fragment thereof, 
in which at least two naturally occurring cysteine 
residues within the ligand-binding domain are substituted 
with non-cysteine residues. In a further embodiment, the 
invention provides a SF-l-like receptor variant, or 

30 active fragment thereof, which lacks at least three 
naturally occurring cysteine residues within the 
ligand-binding domain of the receptor. If desired, at 



least three naturally occurring cysteine residues within 
the ligand-binding domain of the SF-l-like receptor 
variant can be substituted with non-cysteine residues. 

A variety of amino acid substitutions can be 
5 useful in the SF-l-like receptor variants and active 
fragments of the invention. In particular embodiments, 
the ligand-binding domain of a SF^l-like receptor or 
active fragment thereof contains substitutions of at 
least one, two or three naturally occurring cysteine 

10 residues, where each of the cysteine residues is 
independently substituted with serine, threonine, 
alanine, valine, glycine, leucine, isoleucine, tyrosine, 
phenylalanine, tryptophan, methionine or histidine. In 
further embodiments, at least one, two or three naturally 

15 occurring cysteine residues each is independently 

substituted with serine or threonine. In still further 
embodiments, at least one, two or three naturally 
occurring cysteine residues each is independently 
substituted with serine. 

20 Further provided by the invention is a 

SF-l-like receptor variant, such as a SF-1 or LRH-1 
receptor variant, or an active fragment thereof, which 
contains amino acid substitutions at cysteine residues 
corresponding to C267, C3 02 and C423 of murine SF-1. In 

25 one embodiment, the invention provides a SF-1- receptor 
variant, or active fragment thereof, containing amino 
acid substitutions at cysteine residues corresponding to 
0261 , C3 02 and C423 of murine SF-1. Such a SF-1 receptor 
variant can be, for example, a human SF-1 receptor 

3 0 variant. In another embodiment, the invention provides a 
SF-1 receptor variant, or active fragment thereof, 
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containing serine siibstitutions at cysteine residues 
corresponding to C267, C302 and C423 of murine SF-1, In 
a further embodiment, the invention provides a human SF-1 
receptor variant, or active . fragment thereof, containing 
5 serine substitutions at cysteine residues corresponding 
to C267, C302 and C423 of murine SF-1. 

The invention also provides a SF-1 -like 
receptor variant, such as a SF-1 or LRH-l receptor 
variant, or an active fragment thereof, which contains 

10 amino acid substitutions at cysteine residues 

corresponding to C302 and G423 of murine SF-1. Such a 
SF-1 receptor variant, or active fragment thereof, can 
contain, for example, amino acid substitutions at 
cysteine residues corresponding to C302 and C423 of 

15 murine SF-1. In one embodiment, the invention provides a 
human SF-1 receptor variant, or active fragment thereof, 
containing amino acid substitutions at cysteine residues 
corresponding to C302 and C423 of murine SF-1. In 
another embodiment, the invention provides a SF-1 

20 receptor variant, or active fragment thereof, that 
contains serine substitutions at cysteine residues 
corresponding to C302 and C423 of murine SF-1. In yet a 
further embodiment, the invention provides a human SF-1 
receptor variant, or active fragment thereof, that 

25 contains serine substitutions at cysteine residues 
corresponding to C302 and C423 of murine SF-1. 

The present invention additionally provides a 
SF-l-like receptor variant, such as a SF-1 or LRH-1 
receptor variant, or an active fragment thereof, which 
30 contains amino acid substitutions at cysteine residues 
corresponding to C408 and C413 of murine SF-1. In one 
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embodiment, the invention provides a SF-1 receptor 
variant, or active fragment thereof, containing amino 
acid substitutions at cysteine residues corresponding to 
C408 and C413 of murine SF-1. In another embodiment, the 
5 invention provides a human SF-1 receptor variant, or 
active fragment thereof, containing amino acid 
substitutions at cysteine residues corresponding to C408 
and C413 of murine SF-1. In a further embodiment, the 
invention provides a SF-1 receptor variant, or active 
10 fragment thereof, containing serine substitutions at 
cysteine residues corresponding to C408 and C413 of 
murine SF-1. As a non- limiting example, such a SF-1 
receptor variant can be a human SF-1 receptor variant. 

Further provided herein is an LRH-1 receptor 

15 variant having amino acid substitutions at cysteine 

residues corresponding to C267 and C302 of murine SF-1. 
Such an LRH-1 receptor variant of can have, for example, 
serine substitutions at cysteine residues corresponding . 
to C267 and C302 of murine SF-1, and further can be, for 

20 example, a human LRH-1 receptor variant. The invention 
also provides an LRH-1 receptor variant having an amino 
acid substitution at a cysteine residue corresponding to 
C302 of murine SF-1. Such a substitution can be, without 
limitation, a serine substitution, and such an LRH-1 

25 receptor variant can be, for example, a human LRH-1 
receptor variant. Also provided herein is an LRH-1 
receptor variant having an amino acid siibstitution at a 
cysteine residue corresponding to C408 of murine SF-1. 
Amino acid substitutions useful in such an LRH-1 receptor 

30 variant include, but are not limited to, serine 

substitutions. A variety of LRH-1 receptor variants can 
have an amino acid substitution at a cysteine residue 
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corresponding to C408 of murine SF-1 including, for 
example, a human LRH-1 receptor variant. 



The present invention further provides a 
properly folded SF-1 receptor variant, or active fragment 
5 thereof; which has an amino acid sequence encoding a SF-1 
receptor or an active fragment thereof and that, lacks at 
least one of the following naturally occurring cysteine 
residues: a cysteine residue corresponding to C408 of 
murine SF-1 or a cysteine residue corresponding to C413 

10 of murine SF-1. Such a SF-1 receptor variant or active 
fragment of the invention can exhibit increased monomer 
stability as compared to an analogous receptor that 
retains all cysteine residues lacking in the variant. In 
one embodiment, the invention provides a SF-1 receptor 

15 variant or active fragment in which a cysteine residue 
corresponding to C408 or C413 of murine SF-1 is 
substituted with a non-cysteine residue. In another 
embodiment, the invention provides a SF-1 receptor 
variant or active fragment having an amino acid sequence 

20 that lacks naturally occurring cysteine residues 

corresponding to C4 08 and C413 of murine SF-1. Such a 
SF-1 receptor variant or active fragment can have, for 
example, amino acid substitutions at cysteine residues 
corresponding to C408 and C413 of murine SF-1. In a 

25 further embodiment, cysteine residues corresponding to 
C4 08 and C413 of murine SF-1 are independently 
substituted with serine, threonine, alanine or valine. 
In yet a further embodiment, cysteine residues 
corresponding to C4 08 and C413 of murine SF-1 are 

30 independently substituted with serine or threonine. In 
still a further embodiment, cysteine residues 
corresponding to C408 and C413 of murine SF-1 are 
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substituted with serine. Any of the above SF-l receptor 
active fragments can be, for example, an active fragment 
that includes a ligand-binding domain and further can be, 
for example, an active fragment that includes a 
5 ligand-binding domain without additional SF-1 receptor 
sequence . 

In another embodiment, the invention provides a 
human SF-1 receptor variant or active fragment thereof 
which has an amino acid sequence encoding a human SF-1 

10 receptor or an active, fragment thereof and that lacks a 
cysteine residue corresponding to C408 or C413 of murine 
SF-1. In a further embodiment, the invention provides a 
murine SF-1 receptor variant or active fragment thereof 
which has an amino acid sequence encoding a murine SF-1 

15 receptor or an active fragment thereof and that lacks a 
cysteine residue corresponding to C408 or C413 of murine 
SF-1. In yet a further embodiment, the invention 
provides a murine SF-1 receptor variant or active 
fragment thereof that includes the amino acid sequence 

20 SEQ ID NO: 19. 

Further provided herein is a properly folded 
SF-1 receptor variant or active fragment thereof that has 
an amino acid sequence which encodes a SF-1 receptor or 
an active fragment thereof and lacks at least one of the 

25 following naturally occurring cysteine residues: a 

cysteine residue corresponding to C267 of murine SF-1; 
a cysteine residue corresponding to C302 of murine SF-1; 
and a cysteine residue corresponding to C423 of murine 
SF-1. Such a SF-1 receptor variant or active fragment 

30 thereof can exhibit increased monomer stability as 
compared to an analogous receptor or fragment which 
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retains all cysteine residues lacking in the variant. In 
one embodiment, the invention provides a SF-1 receptor 
variant or active fragment in which at least one 
naturally occurring cysteine residue is substituted with 
5 a non-cysteine residue. In another embodiment-, the 
invention provides a SF-1 receptor variant or active 
fragment thereof having an amino acid sequence that lacks 
naturally occurring cysteine residues corresponding to 
C267 and C302 of murine SF-1. Such an amino acid 

10 sequence can have, for example, amino acid substitutions 
at cysteine residues corresponding to C267 and C302 of 
murine SF-1. In a further embodiment, the invention 
provides a SF-1 receptor variant or active "fragment 
thereof having an amino acid sequence that lacks 

15 naturally occurring cysteine residues corresponding to 
C267 and C423 of murine SF-1. Such an amino acid 
sequence can have, for example, amino acid substitutions 
at cysteine residues corresponding to C267 and C423 of 
murine SF-1. The invention also provides a SF-1 receptor 

20 variant or active fragment thereof that lacks naturally 
occurring cysteine residues corresponding to C302 and 
C423 of murine SF-1. Such a SF-1 receptor variant or 
active fragment can have, for example, amino acid 
substitutions at cysteine residues corresponding to C302 

25 and C423 of murine SF-1. In another embodiment, the 
invention provides a SF-1 receptor variant or active 
fragment thereof that lacks naturally occurring cysteine 
residues corresponding^ to corresponding to C267, C302 and 
C423 of murine SF-1. In such a variant or active 

30 fragment, useful amino acid sequences include, without 
limitation, those having amino acid substitutions at 
cysteine residues corresponding to C267, C302 and C423 of 
murine SF-1. In any of the above SF-1 receptor variants 
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or active fragments which include amino acid 
substitutions, each of the substituted cysteine residues 
can be independently substituted with, for example, 
serine, threonine, alanine or valine. In other 

5 embodiments, the invention provides any of the above SF-1 
receptor variants or active fragments in which each of 
the substituted cysteine residues is independently 
substituted with serine or threonine. In further 
embodiments, the invention provides any of the above SF-1 
10 receptor variants or active fragments in which each of 
the substituted cysteine residues is substituted with 
serine . 

An active fragment of the invention can 
include, for example, an SF-1 ligand-binding domain. In 

15 one embodiment, the invention provides a SF-1 active 
fragment containing SF-1 sequence limited to the 
ligand-binding domain. A variety of SF-1 receptors are 
useful in the invention including, without limitation, 
human and murine SF-1 receptors. In one embodiment, the 

20 invention provides a murine SF-1 receptor variant or 
active fragment thereof that includes the amino acid 
sequence SEQ ID NO: 15. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 shows a schematic of amino acid 
25 sequences and functional domains of steroidogenic 

factor-1 (SF-1) conserved among various species. The 
human, bovine, rat and mouse SF-1 sequences are 
illustrated, with the classic Cys2-Cys2 zinc finger 
DNA-binding domain (DBD) , A box, hinge region, and ligand 
30 binding domain (LED) indicated. The two major activation 
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domains of SF-1, AFl and AF2, also are shown as well as 
homologous regions R-II and R-III. Percentage homologies 
are indicated along with key amino acid numbers. 

Figure 2 shows a sequence alignment of the SF7I 
hinge-ligand binding domain region with another member of 
the subclass V family of nuclear receptors, liver related 
homolog 1 (LRH-1) . Included in the alignment are the 
amino acid sequences of the hinge-ligand binding domain 
of murine (SEQ ID NO: 1) and human (SEQ ID NO: 2) SF-1; 
murine (SEQ ID NO: 3)- and human (SEQ ID NO: 4) LRH-1; and 
human retinoic X receptor a (RXRa; SEQ ID NO: 5) . The 
amino acid numbering is indicated at the beginning of 
each line. Predicted a helices (twelve) and 3-turns are 
indicated in bold and designated above the secpaences. 

Figure 3 shows the nucleotide sequence (SEQ ID 
NO: 6) and corresponding amino acid sequence (SEQ ID 
NO: 7) of murine SF-1 available from Genbank accession 
NM_139051 (version NM_139051.1; see, also, Ikeda et al., 
Mol. Endocrin. 7:825 (1993). 

Figure 4 shows the nucleotide sequence (SEQ ID 
NO: 8) and corresponding amino acid sequence (SEQ ID NO: 
9) of human SF-1 available from Genbank accession U76388 
(version U76388. 1) . 

Figure 5 shows the nucleotide sequence (SEQ ID 
NO: 10) and corresponding* amino acid sequence (SEQ ID 
NO: 11) of murine LRH-1 available from Genbank accession 
M81385 (version M81385.1) . 
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Figure 6 shows the nucleotide sequence (SEQ ID 
NO: 12) and corresponding amino acid sequence (SEQ ID 
NO: 13) of human LRH-1 available from Genbank accession 
NM_003822 (version NM_003822.2; see, also, Li et al . , J. 
5 Biol. Chem, 273:29022-29031 (1998)). 

Figure 7 shows the nucleotide and corresponding 
amino acid sequences (residues 219-462) of four SF-1 
variants each containing multiple cysteine to serine 
mutations. (A) The nucleotide and corresponding amino 

10 acid sequence of SF-1 Variant 1, which contains serine 

substitutions at cysteines C267, C3 02 and C423, are shown 
as SEQ ID NOS: 14 and 15, respectively. (B) The 
nucleotide and corresponding amino acid sequence of SF-1 
Variant 2, which contains serine substitutions at 

15 cysteines C302 and C423, are shown as SEQ ID NOS: 16 
and 17, respectively. (C) The nucleotide and 
corresponding amino acid sequence of SF-1 Variant 3, 
which contains serine substitutions at cysteines C408 and 
C413, are shown as SEQ ID NOS: 18 and 19, respectively. 

20 (D) The nucleotide and corresponding amino acid sequence 
of SF-1 Variant 4, which contains serine substitutions at 
cysteines C302, C408, C413 and C423, are shown as SEQ ID 
NOS: 20 and 21, respectively. 

Figure 8 shows a purification scheme for SF-1 
25 ligand-binding domain and characterization of fractions 
after various purification steps. (A) A flow chart of 
steps for purification of SF-1 ligand-binding domain. 
This protocol includes three chromatography steps: 
TALON™, ion exchange, and gel filtration chromatography. 
30 (B) Schematic of the histidine-tagged SF-1 (177-462) 
construct and removal of the 6XHIS-tag from the 
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SF-1 (177-462) protein by site-specific cleavage using the 
tobacco etch viral protease (TEV) . A schematic of the 
bound and flov? through fractions obtained using TALON"™ 
chromatography also is shown. (C) SDS-PAGE 
5 electrophoresis of the SF-1 (177-462) protein treated with 
increasing amounts of TEV protease. The 20kDa TEV 
protease can be seen as a faint protein band running 
slightly below cleaved SF-1 (177-462) protein (arrow). 
(D) Results obtained after the first TALON™ 

10 chromatography step (left panel) , TEV cleavage and second 
TALON™ chromatography step (middle panel) , and 
cent rifugat ion to remove aggregates and minor 
contaminants (right panel) . ''B" indicates bound 
fraction; "FT" indicates the flow through fraction; ''P" 

15 indicates pellet. 

Figure 9 shows the results of ion-exchange 
chromatography and gel filtration of SF-1 (177-462) . Left 
panel: A260 nM profile after cation-exchange 
chromatography (POROS-Q'') . Right panel: A260 nM profile 
20 following gel filtration showing both a minor dimeric and 
major monomeric SF-1 (177-462) species . Native 
polyacrylamide gel electrophoresis of the first peak 
(#1) , and the second, lower molecular weight peak (#2) 
also is shown. 

25 Figure 10 shows reduction of SF- 1 (177-462) 

dimers by treatment with iodoacetate. Top panel: Mass 
spectroscopy on a Voyager DE (MALDI-TOF) instrument of 
aged SF-1 (177-462) protein preparations. Lower panel: 
Mass spectroscopy of SF-1 (177-462) preparations 

30 iodoacetylated prior to storage for a similar time period 
as the aged preparations shown in the top panel. 
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Figure 11 shows a thermal denaturation curve 
for SF-1 Variant 3 generated using circular dichroism. 



. Figure 12 shows dynamic light scattering 
analysis (DLS) of wild type SF-1 and several SF-1 
5 variants. (A) DLS analysis of wild type SF-1 (219-462) • 
(B) DLS analysis of Variant 1. (C) DLS analysis of 
Variant 3 . 

Figure 13 shows NMR analysis of SF-l Variants 1 

and 3. 

10 Figure 14 shows specific coactivator binding by 

several SF-1 receptor variants. (A) SF-1 Variant 1 
binding to GRIP and SRC-like coactivator peptides. (B) 
Left panel: SF-1 Variant 3 binding to a GRIP coactivator 
peptide designated "LXXLL" and control peptide *'LXXAA." 

15 Right panel: SF-1 Variant 3 binding to SRC-like 

coactivator peptides designated ''LXXLL" and control 
peptide ^^LXXAA." 

DETAILED DESCRIPTION OF THE INVENTION 

Steroidogenic factor- 1 (SF-1) , also known as 
20 adrenal 4-binding protein (Ad4BP) and officially 

designated NR5A1, is an essential factor in adrenal and 
gonadal development and for the proper functioning of the 
hypothalamic -pi tuitary-gonadal axis. SF-1 is a 
transcription factor which activates the promoters of 
25 various adrenal /gonadal steroid hydroxylase genes, as 
well as a variety of genes essential for endocrine 
organogenesis (Ikeda et al . , Mol . Endocrinol . 7:852-860 
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(1993); Morohashi et al . , Mol . Endocrinol. 7:1196-1204 
(1993); and Parker and Schimmer, Endocr. Rev. 18:361-377 
(1997)). Mammalian SF-1 exhibits significant similarity 
to Drosophila fushi tarazu factor 1 (Ftz-Fl) , a regulator 
5 of the developmental homeobox gene fushi tarazu (Lavorgna 
et al.. Science 252:848-851 (1991); and Ueda et al . , 
Genes Dev. 4:624-635 (1990)). The mouse SF-1 gene 
therefore has been designated Ftz-Fl. SF-1 is conserved 
across both vertebrate and invertebrate species, 
10 indicating a conserved role for the protein in all 

metazoans (Honda et al.,- J. Biol. Chem. 268:7494-7502 
(1993); Lala et al . , Mol. Endocrinol. 6:124'9-1258 (1992); 
Nomura et al . , J. Biol. Chem. 270:7453-7461 (1995); Oba 
et al., Biochem . Biophvs . Res . Comm . 226:261-267 (1996); 
15 Sun et al . , Dev. Biol, 162:426-437 (1994); and Wong et 
al., J. Mol. Endocrinol. 17:139-147 (1996)). SF-1 
homologs have been cloned, for example, from silkworm, 
chicken and frog as well as a variety of mammalian 
species . 

SF-1 is a member of the steroid receptor 
superfamily, and all SF-1 homologs have a common 
structural organization that shares several features with 
other members of the steroid receptor superfamily. A 
classic Cys2-Cys2 zinc finger DNA-binding domain (DBD) is 
present in the amino-terminal region; this domain confers 
high affinity binding to the SF-1 cognate response 
element (see Figure 1) and is essential for DNA binding 
and subsequent transcriptional activation (and Wilson et 
al., Science 256:107-110 (1992); and Wilson et al . , Mol . 
Cell. Biol. 13:5794-5804 (1993)). The major nuclear 
import signal also maps to the tandem zinc finger domain. 
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In contrast to the majority of steroid 
receptors, which function as dimers in DNA-binding and 
transcriptional regulation, SF-1 binds DNA as a monomer 
at an extended AGGTCA site such as the perfect SF-1 
5 binding site, TCAAGGTCA (Wilson et al . , supra, 1993). In 
SF-1 and other monomeric nuclear receptors, amino acid 
residues carboxy- terminal to the DNA-binding domain, 
denoted the *'A" box, contribute to binding specificity by 
recognizing nucleotides 5' to the AGGTCA response 
10 element, resulting in an extended monomer response 

element with increased binding fidelity (Ueda et al., 
Mol. Cell. Biol. 12:5667-5672 (1992); Wilson et al . , 
supra, 1992; and Wilson et al . , supra, 1993). Such 
monomeric nuclear receptors include liver related 
15 homolog 1/f etbprotein transcription factor 

(LRH-l/FTF/SF-13). / nerve growth factor- induced gene-B 
(NGF-IB) , estrogen-related receptor 1 (ERRl) , 
estrogen-related receptor 2 (ERR2) and retinoic acid 
receptor-related orphan nuclear receptor (ROR) . 

A variety of genes bound and regulated by SF-1 
are known in the art. These SF-1 target genes include, 
for example, steroidogenic enzymes such as cytochrome 
P450 cholesterol side-chain cleavage enzyme (P450scc) and 
other steroidogenic targets such as the ACTH receptor; 
gonadal SF-1 target genes such as the gene for the 
male-specific Miillerian inhibiting substance (MIS) , which 
is expressed in the Sertoli cells of the testis and 
responsible for regression of the female specific 
Miillerian duct; and pituitary and hypothalamic target 
genes such as aGSU and the luteinizing hormone 3 subunit 
(LH3) . A variety of additional SF-1 target genes are 
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known in the art. See, for example, Hammer and Ingraham, 
Frontiers in Neurobiolocr/ 20:199-223 (1999). 



Like other members of the steroid receptor 
superfamily, SF-1 contains a conserved ligand-binding 
5 domain positioned at the carboxy- terminus of the receptor 
and a conserved activation function 2 (AF2) sequence in 
the carboxy- terminal region of the ligand-binding domain, 
(see Figure 1) . In many nuclear receptors, this domain 
confers responsiveness to specific ligands that activate 

10 or, in some cases, repress receptor transcriptional 

activity (Evans, Science 240:889-895 (1988); Forman et 
al., Nature 395:612-615 (1998)). While SF- 1 -dependent 
transcriptional activity has been shown in one instance 
to exhibit: a modest increase in response to 25-, 26-, and 

15 27-hydroxycholesterol in CV-1 cells (Lala et al . , Proc . 
Natl. Acad. Sci. USA 94:4895-4900 (1997)), a ligand for 
SF-1 has not been definitively identified, and SF-1 
consequently is referred to as an ^'orphan receptor." 

SF-1 has been shown- to have transact ivating 
2 0 activity in the absence of ''exogenous" ligand. Two 
regions have been identified as important for SF-1 
transact ivat ion. Point mutations within the conserved 
AF2 hexamer motif, LLIEML (SEQ ID NO: 23), which is 
critical for transactivation function of many nuclear 
25 receptors (Mangelsdorf et al . . , Cell 83:835-839 (1995)), 
abrogated SF-1 activity, as did removal of the distal 
hinge region that follows the DNA-binding domain. In 
contrast, much of the ligand-binding domain can be 
- truncated without significantly impairing SF-1 
30 transcriptional activity. Furthermore, in cell lines 



that support SF-1 -transcriptional activity, the AFl 
domain of SF-1 is const i tut ively phosphorylated on 
serine 203. A nonphosphorylatable mutant, SF-1®^"*, 
consistently exhibited a significant 50-80% reduction in 
5 transcriptional activity on the MIS promoter and other 
promoters as compared to wild- type SF-1 activity. Point 
mutations in the AF2 hexamer motif also resulted in 
significant reduction in SF-1 transact ivat ion, and a 
further reduction in activity was observed when the AF2 

10 hexamer mutation was combined with the S2 03A mutation 
(Hammer et al., Mol . Cell 3:521-526 (1999)). In sum, 
maximal SF-1 transcriptional activity requires both the 
AFl in the distal hinge domain and AF2 (Crawford et al . , 
Mol. Endocrinol. 11:1626-1635 (1997); Ito et al . , Mol. 

15 Cell. Biol. 17:1476-1483 (1997)). Two motifs in 

particular, the phosphoSer^^^ and LLIEML (SEQ ID NO: 23) 
hexamer of the AF2 domain, are essential for full SF-1 
transcriptional activity. 

Consistent with a role for SF-1 as a regulator 
20 of steroid hydroxylases, SF-1 is expressed in the primary 
organs . that produce steroid hormones, including adrenal 
cortical cells, testicular Leydig cells, and ovarian 
theca and granulosa cells (Ikeda et al . , Mol . 
Endocrinol . 8:654-662 (1994); Sasano et al . , J. Clin. 
25 Endocrinol. Metab. 80:2378-2380 (1995); Takayama et al . , 
J.. Clin. Endocrinol. Metab. 80:2815-2821 (1995)). SF-1 
also is expressed in the testicular Sertoli cell, the 
pituitary gonadotrope, and the ventral medial nucleus 
(VMN) of the hypothalamus (Asa et al . , J. Clin. 
30 Endocrinol. Metab. 81:2165-2170 (1996); Hatano et al . , 

Develop. 120:2787-2797 (1994); Ikeda et al . , supra, 1994; 
Ingraham et al . , Genes Dev. 8:2302-2312 (1994); Morohashi 
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et al., Mol. Endocrinol. 7:1196-1204 (1993); and Roselli 
et al., Brain Res. Mol, Brain Res. 44:66-72 (1997)). 
SF-1 transcripts have been detected in spleen and 
placenta in addition to the gonad, adrenal, pituitary and 
5 hypothalamus. 

In vivo signif ican'ce of SF-1 has been 
demonstrated in SF-1 knockout mice. Homozygous Ftz-Fl 
-/- mice all died of glucocorticoid and mineralocort icoid 
insufficiency (Luo et al., Mol. Endocrinol. 9:1233-1239 

10 (1995)). The absence of SF-1 resulted in female external 
genitalia regardless of chromosomal sex, consistent with, 
a role for SF-1 in gonadal formation and synthesis of 
androgens such as dihydrotestosterone, which is required 
for development of male external genitalia. Gonads and 

15 adrenal glands were completely absent from both sexes. 
Furthermore, all mice, regardless of chromosomal sex, 
displayed a female internal reproductive tract (Luo et 
al., Cell 77:481-490 (1994); and Sadovsky et al., Proc . 
Natl. Acad> Sci. USA 92:10939-10943 (1995)), consistent 

20 with a known role of SF-1 in regulation of Mullerian 
inhibiting substance (Giuili et al . , Development 
124:1799-1807 (1997); Shen et al . , Cell 77:651-661 
(1994) ) . In the absence of this inhibitory substance, 
regression of the Mullerian duct, the precursor of the 

25 vagina, uterus and fallopian tube, .does not take place. 
SF-1 null mice also lacked follicle stimulating hormone 
(FSH) and luteinizing hormone (LH) expression in the 
anterior pituitary. These results indicate that SF-1 is 
critical for appropriate development of the adrenals, 

30 gonads and pituitary gonadotropes . 
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The phenotype of the SF-1 null mice parallels 
the phenotype observed in the human syndrome of X- linked 
congenital hypoplasia, a disorder which is characterized 
by hypoplastic adrenal glands often accompanied by 
5 profound hypogonadism. The gene responsible for the 
human syndrome, DAX-1 (dosage- sensitive sex 
reversal-adrenal hypoplasia congenita critical region on 
the X chromosome) , localizes to Xp21 and, like deletions 
of SF-1, DAK-1 deletions result in profound adrenal 

10 hypoplasia in humans (Muscatelli et al . , Nature 372:672- 
676 (1994); and Zanaria et al., Nature 372:635-641 
(1994)). Dax-1 also is an orphan nuclear steroid 
receptor expressed in multiple endocrine organs; Dax-1 
and SF-1 appear to colocalize to cells of the adrenals, 

15 gonads, gonadotropes and VMN (Ikeda et al . , Mol . 

Endocrinol . 9:478-486 (1995); Swain et al . , Nat. Genetics 
12:404-409 (1996)). Together with the similar phenotypes 
of SF-1 null mice and Dax mutations in humans, these 
results reinforce the importance of SF-1 and indicate 

20 that SF-1 and Dax-1 can work together as essential 

regulators of the hypothalamic -pituitary- steroidogenesis 
axis in humans. 

As disclosed herein, freshly produced 
protease-cleaved SF-1 (177-462) was predominantly 

25 monomeric (Figure 9) but exhibited an equilibrium which 
favored the dimeric form over time. As shown herein in 
Figure 10, upper panel, mass spectroscopy demonstrated 
significant dimer formation in aged SF-1 (177-462) 
preparations. However, as further disclosed herein, 

30 SF-1 (177-462) protein preparations were iodoacetylated to 
covalently modify surface cysteines; monomer stability 
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was significantly enhanced in iodoacetylated SF-1 
preparations as shovm herein in Figure 10, lower panel . 

As additionally disclosed herein in 
Example III, mass spectroscopy of tryptic fragments of 
5 iodoacetylated murine SF-1, and three dimensional 

modeling of the SF-1 ligand binding domain were used to 
identify cysteines C267, C302, C408, C413 and C423 as 
potentially solvated residues available for 
intermolecular disulfide bond formation. Furthermore, 

10 two different SF-1 mutants containing multiple serine for 
cysteine substitutions were prepared; Variant 1, which 
contains residues 219-462 with serine substitutions at 
C267, C3 02 and C423, and Variant 3, which contains 
residues 219-462 with serine substitutions at C408 and 

15 C413 (see Example IV) . As shown herein in Figures 12 
and 13, these two murine SF-1 variants exhibited a 
significantly reduced tendency to dimerize relative to 
wild type murine SF-1. 

These results indicate that a variant of SF-1 
20 can exhibit increased monomer stability and reduced 

aggregation as compared to the analogous receptor that 
retains the cysteine residues lacking in the variant. 
Based on the high degree of conservation between the 
ligand-binding domains of SF-1 and LRH-1 illustrated, for 
25 example/ -in Figure 2, these results further indicate that 
variants of LRH-1 which are similarly modified to remove 
one or more potentially solvated cysteine residues also 
can be properly folded and can further exhibit increased 
monomer stability as compared to the analogous wild type 
30 LRH-1 receptor. Such SF-1 or LRH-1 variants can be 
useful, for example, in any functional or structural 
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studies that rely on relatively high concentrations of 
protein. As an example, . such variants can be useful in 
protein crystallization studies, which generally rely on 
protein preparations that are at least 80% monomeric. 

5 Thus, the present invention provides a properly 

folded SF-l-like receptor variant, or active fragment 
thereof, which has an amino acid sequence that encodes a 
SF-l-like receptor or an active fragment thereof and that 
lacks at least one naturally occurring cysteine residue 

10 within the ligarld-binding domain of the receptor. Such a 
properly folded SF-1 receptor variant or active fragment 
thereof can exhibit increased monomer stability as 
compared to the analogous receptor that retains the 
cysteine residues lacking in the variant. In one 

15 embodiment, at least one naturally occurring cysteine 

residue within the ligand-binding domain of the SF-l-like 
receptor is substituted with a non-cysteine residue. 

A SF-l-like receptor variant or active fragment 
of the invention can be, for example, a SF-1 receptor 

2 0 variant or a LRH-1 receptor variant, or an active 

fragment thereof.. An active fragment of the invention 
can include, for example, the ligand-binding domain. In 
one embodiment, the invention provides an active fragment 
containing a SF-l-like receptor variant ligand-binding 

25 domain without additional sequence from the same 

receptor. In another embodiment, the invention provides 
an active fragment containing a ligand-binding domain and 
activation function 1 (AFl) of a SF-l-like receptor. In 
a further embodiment, the active fragment contains a 

30 ligand-binding domain and AFl without additional sequence 
from the same receptor. 
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A SF-l-like receptor variant or active fragment 
of the invention lacks at least one naturally occurring 
cysteine residue within the ligand-binding domain of the 
receptor. In one embodiment, the invention provides a 
5 SF-l-like receptor variant, or active fragment thereof, 
which lacks at least two naturally occurring cysteine 
residues within the ligand-binding domain of the 
receptor. In another embodiment, the invention provides 
a SF-l-like receptor variant or active fragment thereof, 

10 in which at least two naturally occurring cysteine 

residues within the ligand-binding domain are substituted 
with non-cysteine residues. In a further embodiment, the 
invention provides a SF-l-like receptor variant, or 
active fragment thereof, which lacks at least three 

15 naturally occurring cysteine residues within the 

ligand-binding domain of the receptor. If desired, at 
least three naturally occurring cysteine residues within 
the ligand-binding domain of the SF-l-like receptor 
variant can be substituted with non-cysteine residues. 

20 The term ''variant," as used herein, means a 

SF-l-like receptor or other monomeric nuclear receptor, 
as described further below, having a non-naturally 
occurring amino acid sequence. A receptor variant of the 
invention has a ligand-binding domain in which one or 
25 more cysteine residues is deleted or substituted as 

compared to the amino acid sequence of the analogous wild 
type receptor. 

A SF-l-like receptor variant of the invention 
is properly folded. As used herein, the term ''properly 
30 folded" means a defined and homogeneous two- and 

three-dimensional structure. A properly folded receptor 
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variant generally has the same two- and three-dimensional 
structure that characterizes the analogous wild type 
receptor. Proper folding can be determined by any of a 
variety of routine techniques, including a well-defined 
5 melting point transition. Thermal denaturation curves 
can be generated, if desired, using circular dichroism as 
disclosed herein. 

In one embodiment, a SF-l-like receptor variant 
exhibits increased monomer stability as compared to the 

10 analogous receptor that retains the cysteine residues 
lacking in the variant. Monomers are proteins formed 
from a single polypeptide chain; dimers are formed from 
the association of two monomers. As used herein, the 
term "increased monomer stability" means that, in a 

15 purified protein preparation at high concentration, a 
higher percentage of monomeric protein is present as 
compared to the percentage of monomeric protein present 
in a purified protein preparation of analogous receptor 
that retains the cysteine residues lacking in the variant 

20 and is assayed under the same conditions. The term 

''analogous receptor," as used herein, means the naturally, 
occurring receptor having a sequence exhibiting the most 
similarity to the receptor variant. An analogous 
receptor may or may not have a wild type sequence and, 

25 furthermore, refers, as appropriate, to an active 

fragment of receptor. Thus, wherein one or more cysteine 
substitutions is made in a murine SF-1 fragment having 
residues 219 to 462, the analogous receptor is a fragment 
of wild type murine SF-1 having residues 219 to 462. It 

30 is understood that, other than lacking one or more 

naturally occurring cysteines, the receptor variant has 
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an identical sequence and is assayed under the same 
conditions as the '^analogous receptor." 

The dimerization observed with aged 
preparations of wild type SF-1 (177-462) , for example, in 
5 Figure 10, occurs at the relatively high protein 
concentrations used in many in vitro studies. In 
general; dimerization or other aggregation of wild type 
SF-1 ligand-binding domain generally is observed when 
purified SF-1 ligand-binding domain concentrations above 
10 30 |iM are used. Thus, it is understood that ''increased 
monomer stability" is assayed at relatively high protein 
concentrations of the receptor variant such as 
concentrations of at least 30 ]M, for example, 50 
to 100 UM. 

15 Methods for assaying increased monomer 

stability include, without limitation, dynamic light 
scattering (DLS) analysis. In DLS analysis, the 
diffusion constant of molecules moving randomly in 
solution is calculated from the autocorrelation function 

20 of the scattered light. The very small signal values 
generally are collected using a solid state photon 
counter (avalanche photo diode) and analyzed using 
autocorrelator electronics incorporating high-speed 
digital signal processors. From the diffusion constant, 

25 the hydrodynamic radius is calculated using the 
Stokes-Einstein equation. 

As further disclosed herein, a properly folded 
SF-l-like receptor variant or active fragment thereof can 
retain one or more functions of the analogous wild type 
30 receptor. Such functions include, without limitation, 



ligand-binding and co-activator binding. As a 
non- limiting example. Variant 3 functioned to 
specifically bind GRIP coactivator peptides as shown in 
Figure 14. These results indicate that SF-l-like 
5 receptor variants can retain one or more wild type 

functions despite having a non-naturally occurring amino 
acid sequence. 

As used herein, the term ''SF-l-like receptor" 
means a SF-1 receptor or LRH-1 receptor. An SF-l-like 

10 receptor is a protein having a ligand binding domain with 
at least about 55% identity to the ligand binding domain 
of murine SF-1 (residues 219 to 462 of SEQ ID NO: 1) or 
with at least about 55% identity to the ligand binding 
domain of murine LRH-1 (residues 296 to 542 of SEQ ID 

15 NO: 3) . An SF-l-like receptor can have, for'example, at 
least 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98% or 
99% identity to the ligand binding domain of murine SF-1 
(residues 219 to 462 of SEQ ID NO: 1) or at least 65%, 
70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98% or 99% 

2 0 identity to the ligand binding domain of murine LRH-1 
(residues 296 to 542 of SEQ ID NO: 3) . 

The invention relates, in part, to a SF-1 
receptor variant or active fragment thereof. As used 
herein, the term ''SF-1" is synonymous with "steroidogenic 

25 factor-1" and means an SF-l-like receptor that is more 
closely related to murine SF-1 (SEQ ID NO: 1) than to 
murine LRH-1 (SEQ ID NO: 3) . An SF-1 receptor generally 
is a protein having a ligand binding domain with at least 
about 65% identity to the ligand binding domain of murine 

30 SF-1 (SEQ ID NO: 1) . The term SF-1 encompasses a variety 
of species homologs, including human, primate and other 
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mammalian homologs of SF-1 such as bovine, horse, pig, 
rat and murine SF-1 homologs as well as other vertebrate 
and invertebrate homologs. The amino acid and encoding 
nucleic acid sequences for such homologs are well known 
5 in the art. As non-limiting examples, a human SF-1 
coding sequence is available from Genbank accession 
U76388/ a bovine SF-1 coding sequence is available from 
Genbank accession D13569 (Honda et al . , J. Biol. Chem. 
268:7494-7502 (1993)); a horse SF-1 coding sequence is 

10 available from Genbank accession AF203911 (Boerboom et 
al.. Endocrinology 141: 4647-4656 (2000)).; a pig SF-1 
coding sequence is available from Genbank accession 
U84399 (Pilon et al . , Endocrinology 139:3803-3812 
(1998) ) ; and a murine SF-1 coding sequence is available 

15 from Genbank accession NM_139051.1. The coding sequences 
of additional SF-1 homologs also are well known in the 
art; examples include, but are not limited to, the coding 
sequence of chicken SF-1 available from Genbank accession 
AB002404 (Kudo and Sutou, Gene 197:261-268 (1997)) and 

2 0 the coding sequence of frog SF-1 available from Genbank 
accession AB017352.1 (Kawano et al . , Gene 222:169-176 
(1998)). The skilled person understands that the coding 
sequences of a variety of other SF-1 homologs are known 
in the art or can be identified by routine methods and 

25 that a variant or active fragment of the invention can be 
prepared from any such SF-1 homolog. 

The invention also relates, in part, to a LRH-1 
receptor variant or active fragment thereof. As used 
herein, the term "LRH-1" is synonymous with *'liver 
30 related homolog 1," '^fetoprotein transcription factor" 
and ''SF-iP" and means an SF-1- like receptor that is more 
closely related to murine LRH-1 (SEQ ID NO: 3) than to 
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murine SF-1 (SEQ ID NO: 1) . A LRH-1 receptor generally 
is a protein having a ligand binding domain with at least 
about 65% identity to the ligand binding domain of murine 
LRH-1 (SEQ ID NO: 3), shovm herein in Figure 2. The term 
5 LRH'l encompasses a variety of species homologs, 

including human, primate and other mammalian homologs of 
LRH-1 such as bovine, horse, pig, rat and murine LRH-1 
homologs as well as other vertebrate and invertebrate 
homologs. The amino acid and encoding nucleic acid 

10 sequences for such homologs are well known in the art. 
As non-limiting examples, a human LRH-1 coding sequence 
is available from Genbank accession NM_003822.2 (Li et 
al., J. Biol. Chem. 273:29022-29031 (1998)); a rat LRH-1 
coding sequence is available from Genbank accession 

15 NM_021742; a murine LRH-1 coding sequence is available 

from Genbank accession M81385; and a chicken LRH-1 coding 
sequence is available from Genbank accession AB002403. 
The skilled person understands that the coding sequences 
of a variety of other LRH-1 homologs are known in the art 

2 0 and that routine molecular methods can be used to 

identify additional homologs. A LRH-1 receptor variant 
or active fragment of the invention can be prepared from 
any such homolog. v> 

A SF-l-like receptor variant or other monomeric 
2 5 nuclear receptor variant or active fragment of the 

invention has an amino acid sequence which lacks at least 
one naturally occurring cysteine residue within the 
ligand-binding domain. Such a receptor variant or active 
fragment of the invention also can have one or more 
30 additional amino acid sequence alterations as compared to 
the wild type receptor sequence. Thus, in addition to 
lacking one or more naturally occurring cysteine residues 
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within the ligand-binding domain, a variant or active 
fragment of the invention can have one or more additions, 
deletions or substitutions that confer, for example, 
altered in vitro or in vivo stability, solubility, 
5 bioavailability, ligand binding activity or coactivator 
binding activity or other favorable characteristic or 
that facilitate, for example, expression, purification or 
detection of the variant or active fragment thereof. 
Such a modification can be deliberate, as through 

10 site-directed mutagenesis, or can be accidental, as 

through mutation in hosts harboring an encoding nucleic 
acid, and can be located within the receptor 
ligand-binding domain or outside of this domain. All 
such modified receptors and active fragments are included 

15 in the definition of a receptor variant as long as the 
variant or active fragment has an amino acid sequence 
which lacks at least one naturally occurring cysteine 
residue within the ligand-binding domain and as long as 
the variant or active fragment retains at least one 

20 biological activity of the analogous wild type receptor. 
Such a biological activity can be, for example, ligand 
binding; coactivator binding such as GRIP binding; or 
transact ivat ion. 



Thus, the term ''SF-l-like receptor variant" 
25 encompasses polypeptides having an amino acid sequence 
which, other than lacking one or more cysteine residues 
within the ligand-binding domain, is identical to the 
analogous wild type receptor sequence, as well as 
polypeptides having a similar, non- identical sequence. 
30 Minor modifications of murine or human SF-1 (SEQ ID NO: 7 
or 9) that do not destroy transact ivat ion activity or the 
ability to associate with a coactivator such as GRIP fall 
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within the definition of a SF-1 receptor variant when 
combined with the deletion or substitution of at least 
one naturally occurring cysteine residue within the 
ligand-binding domain. Similarly, minor modifications of 
5 LRH-1 that do not destroy trans activation activity or the 
ability to associate with a coactivator such as GRIP fall 
within the definition of an LRH-1 receptor variant when 
combined with the deletion or siibstitution of at least 
one naturally occurring cysteine residue within the 
10 ligand-binding domain. 

It further is understood that various moieties 
can be attached to a SF-1 receptor variant, LRH-1 
receptor variant or another SF-1 -like receptor variant, 
monomer ic receptor variant or active fragment of the 

15 invention. As an example, a variety of heterologous tags 
are well known in the art and readily available and can 
be included. in a receptor variant or active fragment of 
the invention. Such tags include epitope tags and 
encompass, without limitation, the V5 tag, the FLAG tag 

20 DYKDDDDK (SEQ ID NO: 24) ; the HA tag YPYDVPDYA (SEQ ID 
NO: 25); the c-Myc epitope EQKLISEEDL (SEQ ID NO: 26); 
the AUl tag DTYRYI (SEQ ID NO: 27); and the 6-HIS tag. 
One skilled in the art understands that these and other 
heterologous tags can be fused to a receptor variant or 

25 active fragment thereof to conveniently purify or detect 
the fused protein. A receptor variant, or active 
fragment of the invention also can be fused, for example, 
to a heterologous DNA-binding domain, or heterologous 
activation domain, in place of the native DNA-binding or 

30 activation domain to form a chimeric receptor variant. 
Additional moieties that can be attached to a receptor 
variant include heterologous polypeptides or polypeptide 



domains, peptides, carbohydrates, lipids, radionuclides 
and fluorescent and other chemical moieties. These 
fusion polypeptides or polypeptide conjugates also are 
encompassed by the invention. 

5 Also provided herein is a monomeric nuclear 

receptor, or active fragment thereof, containing an amino 
acid sequence that encodes a monomeric nuclear receptor 
or an active fragment thereof and lacks at least one 
naturally occurring cysteine residue within the 
10 ligand-binding domain of the receptor.- 

A nuclear receptor can be identified through 
primary, secondary or tertiary structural homology to at 
least one ligand-binding domain of a known nuclear 
hormone receptor such as a retinoid receptor, 

15 glucocorticoid receptor, estrogen receptor, progesterone 
receptor, thyroid receptor, vitamin D receptor, 
mineralocorticoid receptor or another member of the 
steroid hormone receptor superfamily such as those 
described in Whitfield et al . , J. Cell. Biochem. 75 

20 Suppl . 32:110-122 (1999). A nuclear receptor generally 
has a tertiary structure which is a sandwich of 11 to 13 
a-helices and several small [S-strands organized around a 
lipophilic binding cavity (Williams and Sigler, Nature 
393:392-396 (1998)). The ligand-binding domain of a 

25 nuclear receptor generally contains three subregions : a 
conserved El domain; the heptad 9 (h9) subregion; and an 
AF2 subregion. Such a nuclear receptor can be 
recognized, in part, by conserved lysine, phenylalanine 
and aspartic acid-glutamine residues in the El subregion 

30 (Whitfield, supra, 1999) . 
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As used herein, the term ''monomeric nuclear 
receptor" means a nuclear receptor that specifically 
binds DNA as a monomer with high affinity. Such a 
monomeric nuclear receptor generally binds to a cognate 
5 binding site with an affinity of at least about 100 nM 
(Shen et al . , Cell 77: 651-661 (1994)). Examples of 
monomeric nuclear receptors include but are not limited 
to SF-1, LRH-1, NGF-IB, ERRl , ERR2 , ROR, Rev-ErbA and 
Rev-ErbB and other nuclear receptors with similar 

10 properties (Lazar and Harding, ''Monomeric Nuclear 
Receptors" in Freedman (Ed.), Molecular Biology of 
Steroid and Nuclear Hormone Receptors Boston: Birkhauser 
pp. 261-279 (1998)). In one embodiment, the invention 
provides a monomeric nuclear receptor, or active fragment 

15 thereof, other than ROR. 

It is clear that a variety of cysteine 
substitutions can be useful in the SF-l-like receptor 
variants and active fragments of the invention. In 
particular embodiments, the ligand-binding domain of a 

20 SF-l-like receptor or active fragment thereof contains 
substitutions of at least one, two or three naturally 
occurring cysteine residues, where each of the cysteine 
residues is independently substituted with serine, 
threonine, alanine, valine, glycine, leucine, isoleucine, 

2 5 tyrosine, phenylalanine, tryptophan, methionine or 

histidine. In further embodiments, at least one, two or 
three naturally occurring cysteine residues each is 
independently substituted with serine or threonine. In 
still further embodiments, at least one, two or three 

30 naturally occurring cysteine residues is substituted with 
serine . 
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The invention also provides a SF-l-like 
receptor variant, such as a SF-1 or LRH-1 receptor 
variant, or an active fragment thereof, which contains 
amino acid substitutions at cysteine residues 
5 corresponding to C267, C302 and C423 of murine SF-1. In 
one embodiment, the invention provides a SF-1 receptor 
variant, or active fragment thereof, containing amino 
acid substitutions at cysteine residues corresponding to 
C267, C302 and C423 of murine SF-1. Such a SF-1 receptor 

10 variant can be, for example, a human SF-1 receptor 

variant. In another embodiment, the invention provides a 
SF-1 receptor variant, or active fragment thereof, 
containing serine substitutions at cysteine residues 
corresponding to C267, C302 and C423 of murine SF-1. In 

15 a further embodiment, the invention provides a human SF-1 
receptor variant, or active fragment thereof, containing 
serine substitutions at cysteine residues corresponding 
to C267, C302 and C423 of murine SF-1. 

The invention further provides a SF-l-like 
20 receptor variant, such as a SF-1 or LRH-1 receptor 

variant, or an active fragment thereof, which contains 
amino acid substitutions at cysteine residues 
corresponding to C302 and C423 of murine SF-1. In one 
embodiment, the invention provides a SF-1 receptor 
25 variant, or active fragment thereof, containing amino 

acid substitutions at cysteine residues corresponding to 
C302 a"nd C423 of murine SF-1. In another embodiment, the 
invention provides a human SF-1 receptor variant, or 
active fragment thereof, containing amino acid 
30 substitutions at cysteine residues corresponding to C302 
and C423 of murine SF-1. In another embodiment, the 
invention provides a SF-1 receptor variant, or active 



34 

fragment thereof, that contains serine substitutions at 
cysteine residues corresponding to C302 and C423 of 
murine SF-1. In yet a further embodiment, the invention 
provides a human SF-1 receptor variant, or active 
5 fragment thereof, that contains serine substitutions at 
cysteine residues corresponding to C302 and C423 of 
murine SF-1. 



The invention additionally provides a SF-1 -like 
receptor variant, such as a SF-1 or LRH-1 receptor 

10 variant, or an active fragment thereof, which contains 
amino acid substitutions at cysteine residues 
corresponding to C4 08 and C413 of murine SF-1. In one 
embodiment, the invention provides a SF-1 receptor 
variant, or active fragment thereof, containing amino 

15 acid substitutions at cysteine residues corresponding to 
C4 08 and C413 of murine SF-1. In another embodiment, the 
invention provides a human SF-1 receptor variant, or 
active fragment thereof, containing amino acid 
substitutions at cysteine residues corresponding to C408 

20 and C413 of murine SF-1. In a further embodiment, the 
invention provides a. SF-1 receptor variant, or active 
fragment thereof, containing serine substitutions at 
.cysteine residues corresponding to C408 and C413 of 
murine SF-1. As a non- limiting example, such a SF-1 

25 receptor variant can be a human SF-1 receptor variant. 

In the receptor variants and active fragments 
of the invention, one or more cysteines can be 
substituted with alternative amino acids. It is 
understood that a naturally occurring cysteine residue 
3 0 can be substituted with any amino acid; a naturally 
occurring cysteine residue can be substituted, if 
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desired, with an amino acid of similar size and polarity. 
Cysteine is most similar in size and polarity to serine, 
and also is similar in size and polarity to threonine, 
alanine and valine. 

5 It is understood that any number of naturally 

occurring cysteines within the ligand-binding domain of 
SF-1, LRH-1 or another SF-l-like receptor or monomeric 
nuclear receptor can be depleted by deletion or 
substitution to produce a receptor variant or active 

10 fragment of the invention. A SF-l-like receptor variant 
or active fragment of the invention can lack, for 
example, a single naturally occurring cysteine, or can 
lack two or more, three or more, four or more, five or 
more, six or more, or seven or more cysteines that occur 

15 naturally within the ligand-binding domain, for example, 
two, three, four, five, six, seven or eight cysteines 
that occur naturally within the ligand-binding domain. 
Furthermore, where two or more naturally occurring 
cysteines are depleted from the receptor or active 

20 fragment thereof, they can be depleted in any 

combination. It also is understood that, where multiple 
naturally occurring cysteines are depleted, both deletion 
and substitution can be used to remove different 
naturally occurring cysteine residues in the same 

25 receptor variant or active fragment thereof. 

As can be seen in Figure 2, eight cysteine 
residues occur naturally in the ligand-binding domain of 
a SF-1 receptor; these cysteines correspond to C24 8, 
C267, C284, C302, C371, C408, C413 and C423 in murine 
30 SF-1. Thus, a SF-1 variant or active fragment thereof 
can lack one or more cysteines corresponding to C248, 
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C267, C284, C302, C371, C408, C413 or C423 or can lack, 
for example, any combination of two or more, three or 
more, four or more, five or more, six or more, or seven 
or more cysteines corresponding to C248, C267, C284, 
5 C3 02, C371, C4 08, C413 or C423. In one embodiment, a 
SF-1 variant or active fragment thereof lacks a single 
naturally occurring cysteine within the ligand-binding 
domain, where this single cysteine is C248, C267; C284, 
C302, C371, C408, C413 or C423. 



10 As disclosed herein, solvated cysteines that 

can participate in intermolecular disulfide bond 
formation were identified by determining the cysteines 
most highly reactive with iodoacetamide and by modeling 
the three-dimensional model of the murine SF-1 

15 ligand-binding domain using the crystal structure of 
liganded human RXRa (Egea et al . , EMBO J. 19:2592-2601 
(2000)). In particular, C267, C302 and C423 were 
identified by reactivity with iodoacetamide, and C4 08 
and C413 were . identified by three-dimensional modeling. 

20' Thus, a SF-1 receptor variant or active fragment of the 
invention can have, for example, a substitution at a 
* single naturally occurring cysteine residue within the 
ligand-binding domain, where the single substituted 
cysteine residue corresponds to C267, C302, C408, C413 or 

25 C423 in murine SF-1. A SF-1 receptor variant or active 
fragment of the invention also can contain, for example, 
substitutions at two naturally occurring cysteine 
residues within the ligand-binding domain, where the two 
cysteine residues correspond to C267 and C302, C267 and 

30 C408, C267 and C413, C267 and C423, C302 and C408, C302 

and C413, C302 and C423, C408 and C413, C408 and C423, or 
C413 and C423, in murine SF-1. As discussed further 



below, a variety of residues, including, but not limited 
to, serine, threonine, alanine, and valine, can be 
independently substituted for naturally occurring 
cysteine residues . 

5 Of the potentially solvated cysteines in the 

SF-1 ligand-binding domain, four cysteines, those 
corresponding to C267, C302, C371 and C408 in murine 
SF-1, are present in the LRH-1 receptor. Thus, a LRH-1 
receptor variant or active fragment of the invention can 

10 have, for example, a substitution at a single naturally 
occurring cysteine residue within the ligand-binding 
domain, where the single cysteine residue corresponds to 
C267, C302, C371 or C408 in murine SF-1. A. LRH-1 
receptor variant or active fragment of the invention also 

15 can have, for example, substitutions for two naturally 
occurring cysteine residues within the ligand-binding 
domain, where the two cysteine residues correspond to 
C267 and C302, C267 and C371, C267 and C408, C302 and 
C371, C302 and C408, or C371 and C408, in murine SF-1. 

20 It is understood that any of a variety of amino acid 

substitutions can be useful in a LRH-1 variant or active 
fragment of the invention, including, without limitation, 
serine, threonine, alanine, valine and the other neutral 
amino acids described hereinbelow. 

25 In one embodiment, the invention provides a 

SF-1, LRH-1 or other SF-1- like receptor variant having a 
ligand-binding domain in which at least one, two or three 
naturally occurring cysteines is independently 
substituted with serine, threonine, alanine, valine, 

30 glycine, leucine, isoleucine, tyrosine, phenylalanine, 
tryptophan, methionine or histidine. In a further 
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embodiment, each of the s\ibstituted cysteines in the 
receptor variant or active fragment thereof is 
independently substituted with serine, threonine, alanine 
or valine. In yet a further embodiment, each of the 
5 substituted cysteines in the receptor variant or active 
fragment thereof is independently substituted with serine 
or threonine. In another embodiment, each of the 
substituted cysteines in the receptor variant or active 
fragment thereof is substituted with serine. 

10 As used herein in reference to the substitution 

of two or more naturally occurring cysteine within a 
ligand-binding domain, the term ''independently" means 
that the amino acids selected to substitute for the 
naturally occurring cysteines can be the same or 

15 different. As an example, the invention encompasses a 
SF-l-like receptor variant, or active fragment thereof, 
which has a serine for cysteine substitution at the 
cysteine corresponding to C267, an alanine for cysteine 
substitution at the cysteine corresponding to C302, and a 

20 valine for cysteine substitution at the cysteine 

substitution corresponding to C423. Similarly, the 
invention encompasses a SF-l-like receptor variant, or 
active fragment thereof, which has a threonine for 
cysteine substitution at the cysteine corresponding to 

25 C302 and serine for cysteine substitution at the cysteine 
corresponding to C423, or which has serine for cysteine 
substitutions at both C408 and C413. 

It is understood that, in different species 
homologs and isotypes and in different receptors, the 
30 residue numbering of a particular cysteine may vary; 
specific cysteine residues are defined herein with 
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reference to the murine SF-1 amino acid sequence shown in 
Figure 2. As used herein, the term "corresponding to" 
means the cysteine residue in a different receptor or 
species homolog that occupies, relative to the reference 
5 cysteine, the analogous position in the primary, 

secondary or tertiary structure of the receptor. Such a 
"corresponding" cysteine residue can be readily 
identified by routine methods, for example, alignment of 
primary sequences or assignment of conserved secondary 

10 structures • As an example, cysteine 267 in murine SF-1 
corresponds to cysteine 266 in human SF-1, to 
cysteine 365 in murine LRH-1, and to cysteine 346 in 
human LRH-1. A cysteine corresponding to C267 also can 
be identified, for example, as a cysteine positioned 

15 within helix 3. Similarly, a cysteine corresponding to 
C302 can be identified by its position between helices 4 
and 5; a cysteine corresponding to C408 can be identified 
by its position at the C- terminus of helix 9; a cysteine 
corresponding to C413 can be identified by its position 

20 between helices 9 and 10; and a cysteine corresponding to 
C423 can be identified by its position between helices 10 
and 11. 

The present invention further provides a 
nucleic acid molecule which encodes a SF-1 -like receptor 

25 variant, or active fragment thereof, having an amino acid 
sequence lacking at least one naturally occurring 
cysteine residue within the ligand-binding domain. The 
invention further provides vectors and related host cells 
that contain a nucleic acid molecule encoding a SF-1 -like 

30 receptor variant or active fragment of the invention. In 
one embodiment, the vector is a bacterial expression 
vector . 
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A nucleic acid molecule of the invention can 
encode any of the SF-l-like receptor variants or active 
fragments of the invention, including receptor variants 
and active fragments that exhibit increased monomer 
5 stability at high concentration. A nucleic acid molecule 
of the invention can encode, for example, a SF-1 receptor 
variant or active fragment thereof, or a LRH-1 receptor 
variant or active fragment thereof. The active fragment 
can include the ligand- binding domain, with or without 

10 additional receptor sequence, or can contain the 

ligand-binding domain and AFl, with or without additional 
sequence. It is understood, therefore, that an active 
fragment can be a larger fragment encompassing the ligand 
binding domain, or encompassing both the ligand-binding 

15 domain and AFl. 

In particular embodiments, a nucleic aicid 
molecule of the invention encodes a SF-l-like receptor 
variant or active fragment thereof which lacks at least 
one, two or three cysteine residues within the 

20 ligand-binding domain of the receptor. In additional 
embodiments, the invention provides a nucleic acid 
molecule encoding a SF-l-like receptor variant or active 
fragment thereof, in which at least one, two, or three 
cysteine residues within the ligand-binding domain are 

25 substituted with non-cysteine residues. In a further 
embodiment, the one or more substituted cysteines are 
independently substituted with serine, threonine, 
alanine, valine, . glycine, leucine, isoleucine, tyrosine, 
phenylalanine, tryptophan, methionine or histidine. In 

30 another embodiment, the one or more substituted cysteines 
are independently substituted with serine or threonine. 
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Further provided by the invention is a nucleic 
acid molecule encoding a SF-1 or LRH-1 receptor variant, 
or an active fragment thereof, which contains amino acid 
substitutions at cysteine residues corresponding to C267, 
5 C302 and C423 of murine SF-1. Such a nucleic acid 

molecule can have, for example, serine substitutions at 
the cysteine residues corresponding to C267, C302 
and C423 of murine SF-1. The invention also provides a 
nucleic acid molecule encoding a SF-1 or LRH-1 receptor 

10 variant, or an active fragment thereof, which contains 
amino acid substitutions at cysteine residues 
corresponding to C302 and C423 of murine SF-1. In such a 
nucleic acid molecule, the cysteine residues 
corresponding to C302 and C423 of murine SF-1 can be 

15 substituted, for example, with serine. The invention 

further provides a nucleic acid molecule encoding a SF-1 
or LRH-1 receptor variant, or an active fragment thereof, 
which contains amino acid substitutions at cysteine 
residues corresponding to C408 and C413 of murine SF-1. 

20 In one embodiment, the nucleic acid molecule encodes a 

SF-1 or LRH-1 receptor variant or active fragment thereof 
containing serine substitutions at cysteine residues 
corresponding to C408 and C413 of murine SF-1. 

As used herein, the term "nucleic acid 
25 molecule" means any polymer of two or more nucleotides, 
which are linked by a covalent bond such as a 
phosphodiester bond, a thioester bond, a phosphorothioate 
bond or any of various other bonds known in the art as 
useful and effective for linking nucleotides. Such 
30 nucleic acid molecules can be linear, circular or 

supercoiled, single-stranded or double -stranded, D- or 
L-sterioisomers, and can be, for example, DNA or RNA, or 
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a DNA/RNA hybrid. Nucleic acid molecules of the 
invention encompass those with modified linkages or 
containing one or more nucleoside analogs including 
non-naturally occurring RNA and DNA analogs; such nucleic 
5 acid molecules can exhibit resistance to enzymatic or 
chemical degradation . 

The invention also provides vectors which 
contain a nucleic acid molecule encoding a SF-l-like 
variant or active fragment thereof having an amino acid 

10 sequence lacking at least one naturally occurring 

cysteine residue within the ligand-binding domain of the 
receptor, wherfe the SF-l-like receptor variant or active 
fragment is properly folded upon expression. Such 
vectors, which can be cloning vectors or expression 

15 vectors, provide a means to transfer an exogenous nucleic 
acid molecule into a prokaryotic or eukaryotic host cell. 
Contemplated vectors include vectors derived from 
bacteria or a combination of bacterial and viral 
sequences, such as plasmids and cosmids, and those 

20 derived from a virus, such as a bacteriophage, 

baculovirus a retrovirus. The vectors of the invention 
can advantageously be used to clone or express a SF-1 
receptor variant, LRH-1 receptor variant, or other 
receptor variant or active fragment of the invention. 

25 Various vectors and methods for introducing such vectors 
into a host cell are described, for example, in Ausubel 
et al . , supra, 2000. 



In addition to a nucleic acid molecule encoding 
a SF-l-like receptor variant or active fragment thereof, 
30 a vector of the invention also can contain, if desired, 
one or more of the following non-limiting elements: an 
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oligonucleotide encoding, for example, a transcription or 
translation regulatory element or a termination codon; 
one or more selectable marker genes, such as an 
ampicillin, kanamycin, tetracycline, neomycin, hygromycin 
5 or zeomycin resistance gene, which is useful for 

selecting transf ormants or stable transf ectants ; one or 
more enhancer or promoter sequences, which can be 
obtained, for example, from a bacterial, viral or 
mammalian gene; transcription termination and RNA 

10 processing signals, which are obtained from a gene or a 
virus such as SV40; an origin of replication such as an 
SV40, polyoma or E. coli origin of replication; versatile 
multiple cloning sites; and one or more RNA promoters 
such as a T7 or SP6 promoter, which allows for in vitro 

15 transcription of sense and antisense RNA. 

In one embodiment, a vector of the invention is 
an expression vector. Expression vectors are well known 
in the art and provide a means to transfer and express an 
exogenous nucleic acid molecule in a host cell. 

20 Contemplated expression vectors include vectors that 

provide for expression in a host cell such as a bacterial 
cell, yeast cell, insect cell, frog cell, mammalian cell 
or other animal cell. Such expression vectors include 
regulatory elements specifically required for expression 

25 of the DNA in a cell, the elements being located relative 
to the nucleic acid molecule encoding the SF-l-like 
receptor variant or active fragment so as to permit 
expression thereof. The regulatory elements can be 
chosen to provide constitutive expression or, if desired, 

30 inducible or cell type-specific expression. Regulatory 
elements required for expression include transcription 
and translation start sites and termination sites and are 
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well known in the art. Such sites permit binding, for 
example, of RNA polymerase and ribosome subunits. A 
bacterial expression vector can include, for example, an 
RNA transcription promoter such as the lac promoter, a 
5 Shine -Delgarno sequence and an initiator AUG codon in the 
proper frame to allow translation of an amino acid 
sequence. The expression vector pBH (Hoffman-La Roche) 
is one of many expression vectors suitable for high level 
expression in bacteria and is specifically encompassed by 
10 the invention. 

Any of a variety of routine techniques can be 
used to prepare a nucleic acid molecule encoding a 
receptor variant or active fragment of the invention 
based on the known sequence of wild type receptor. Wild 
15 type SF-1 and LRH-1 sequences are shown in Figures 3 

through 6 and described hereinabove and the sequences of 
additional SF-1- like receptors and other monomeric 
nuclear receptors are well known in the art or can be 
obtained by routine methods. 

20 A variety of routine molecular methods can be 

used to prepare a nucleic acid molecule encoding a 
SF-1 -like receptor variant or active fragment of the 
invention. As a non- limiting example, 

oligonucleotide-directed mutagenesis can be useful in 
25 preparing the compositions of the invention. 

Oligonucleotide-directed mutagenesis utilizes an 
oligonucleotide primer encoding the desired deletion or 
substitution to prime DNA synthesis on a single -stranded 
uracil -containing template. After polymerization, for 
30 example, with T4 polymerase and addition of T4 DNA 
ligase, wild type bacteria are transformed, and DNA 
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encoding the variant receptor isolated (see, for example, 
Smith, Ann . Rev . Genet . 19:423-463 (1985); Ausiibel et 
al . , Current Protocols in Molecular Biology Chapter 8 
John Wiley & Sons, Inc. (2000)). 

5 A nucleic acid molecule encoding a SF-l-li)ce 

receptor variant such as a SF-1 or LRH-1 variant, or 
another monomeric nuclear receptor variant or active 
fragment of the invention also can be conveniently 
generated from a wild type sequence using the polymerase 

10 chain reaction (PCR) . A synthetic oligonucleotide can be 
designed, for example, to incorporate a point mutation at 
one end of an amplified fragment. Following PCR 
amplification, the amplified fragment can be made 
blunt-ended using Klenow fragment, and then ligated and 

15 subcloned into a convenient vector for sequence analysis. 
Alternatively, where convenient restriction sites flank 
the region encoding the cysteine residue to be mutated, 
two oligonucleotides encompassing the mutation are 
annealed and extended with mutually primed synthesis. In 

20 a second PCR step, the desired fragment is amplified 
using outside primers, digested with the appropriate 
restriction enzymes and subcloned into the corresponding 
segment of wild type receptor. The resulting plasmid 
contains a fragment identical to the original wild type 

25 receptor sequence, except for the point mutation 

introduced by the pair of overlapping primers. These and 
related PCR procedures are routine and well known in the 
art of molecular biology. See, for example, Ausubel, 
supra, 2000. 
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A nucleic acid molecule encoding a SF-l-like 
receptor variant such as a SF-1 or LRH-1 receptor variant 
or another monomeric nuclear receptor variant or active 
fragment of the invention also can be assembled from 
5 mutually priming long oligonucleotides. Such a strategy- 
is particularly useful for generating nucleic acids up to 
about 4 00 bp in length. The appropriate oligonucleotides 
are designed, including at least one oligonucleotide 
encoding the desired deletion or substitution of a 

10 naturally occurring cysteine, and typically have a length 
of more than 100 nucleotides. It is understood that due 
to degeneracy of the genetic code, codons that do not 
occur in the native sequence can be present in the 
synthetic sequence encoding the receptor variant. The 

15 two oligonucleotides are designed with complementary 

sequence at their 3* end to form a short duplex segment 
that allows priming by the polymerase. Two or more of 
such pairs of oligonucleotides can be designed, if 
desired, with restriction endonuclease sites at the end 

20 to join adjacent fragments and for subcloning into a 
desired vector (Ausubel et al . , supra, 2000). The 
skilled person understands that these and other 
procedures routine in the art of recombinant DNA 
technology can be used to prepare the nucleic acid 

25 molecules of the invention. 

The invention also provides a host cell 
containing an expression vector that includes a nucleic 
acid molecule encoding a SF-l-like receptor variant or an 
active fragment thereof, where the SF-l-like receptor 
30 variant or active fragment is properly folded upon 

expression. Such a host cell can be used to express the 
encoded receptor variant or active fragment thereof; 
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recombinant SF-l-like receptor variant can be 
substantially purified from the host cells using well 
known biochemical procedures (see, for example. Example I 
and Ausubel, supra, 2000) . As disclosed herein in 
5 Example V, a SF-l-like receptor variant or active 
fragment of the invention is properly folded upon 
expression and can have one or more biological activities 
of the analogous wild type receptor. 

Non-limiting examples of host cells useful in 
10 the invention include bacterial, yeast and frog cells, 
and mammalian and other animal cells. Various bacterial 
cells suitable for high level expression of exogenous 
proteins are well known in the art and include, without 
limitation, E. coli strains such as BL21(DE3) pLysS 
15 (Novagen, Inc.; Madison, WI) . Various mammalian cells 
useful as host cells encompass, yet are not limited to, 
murine NIH/3T3 cells, Chinese hamster ovary (CHO) cells, 
COS cells and HeLa cells. Methods for introducing a 
vector into a host cell include electroporation, 
20 microinjection, calcium phosphate or DEAE-dextran 

transf ection, lipofection and other methods well known in 
the art . 

Further provided by the invention is a method 
of preparing an isolated properly, folded SF-l-like 

25 receptor variant, or active fragment thereof, by 

expressing in a host cell an expression vector containing 
a nucleic acid molecule encoding a SF-l-like receptor 
variant or active fragment thereof having an amino acid 
sequence lacking at least one naturally occurring 

30 cysteine residue within the ligand-binding domain of the 
receptor, and isolating said SF-l-like receptor variant. 
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or active fragment thereof, from the host cell. In one 
embodiment, the host cell is a bacterial host cell such 
as an E. coli BL21(DE3) pLysS cell. In another 
embodiment, the SF-l-like receptor or active fragment 
5 thereof contains a heterologous epitope tag, which can 
facilitate purification. Such an epitope tag can be, 
without limitation, .a 6-histidine- tag, or any of the 
epitope tags described hereinabove or known in the art. 
Methods for purification of the resultant hormone 
10 receptors are described herein in Example I and also are 
well known in the art as described, for example, in 
Deutscher, Methods in Enzvmoloav , Vol. 182, "Guide to 
Protein Purification," San Diego: Academic Press, Inc. 
(1990) . 

15 The following examples are intended to 

illustrate but not limit the present invention. 

EXAMPLE I 

NATIVE SF-1 LIGAND-BINDING DOMAIN FORMS DIMERS 
AT HIGH CONCENTRATIONS 

20 This examples describes purification and 

characterization of the AFl/LBD domain of SF-1. 

A SF-1 fragment encoding .activation domain 1 
and the complete ligand-binding domain (residues 177-462) 
was expressed in bacteria and purified using the scheme 
25 shown in Figure 8A. The purification procedure included 
three chromatography steps: TALON™, ion exchange, and gel 
filtration chromatography as described further below. 
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The SF-1 protein construct encoding 
residues 177-462 was expressed using vector pBH in 
E. coli BL21(DE3) pLysS (Novagen, Inc.), Cultures were 
grown to 0.3 O.D. at 37°C and subsequently transferred to 
5 room temperature, where the cultures were induced 

with 0.2 mM ITPG . After growth at 30^C for five hours, 
cultures were harvested and frozen at -70 °C. 

Bacterial pellets were lysed in lysis buffer 
(50 mM TRIS pH 8.0, 100 mM NaCl, 5% glycerol) containing 

10 protease inhibitors. After addition of lysozyme and 
lysis by three cycles of f reeze-thawing in liquid 
nitrogen, cell lysates were treated with DNase A to 
reduce viscosity. The concentration of KCl was then 
adjusted to 0.3 M, and lysates were centrifuged at 15,000 

15 rpm in a GS 34 Sorvall rotor for 3 0 minutes at 4^*0. The 
soluble fraction of His-tagged SF-1 (about 80% of total 
protein expressed) was purified by mixing lysates with 
TALON™ beads (BD Biosciences Clontech; Palo Alto, CA) 
inside a Biorad ReadyPack column (Biorad Laboratories; 

20 Hercules, CA) : The columns were incubated with rotation 
for 30 minutes at 4''C and subsequently washed with lysis 
buffer supplemented by 10 mM imidazole, 2 mM ATP, and 10 
mM MgCls- His-tagged protein was eluted with an 
imidazole gradient (80 mM imidazole) , and was typically 

25 more than 90% pure. 

The 6XHIS-tag was removed from SF-1 (177-462) by 
site-specific cleavage using the tobacco etch viral 
protease (TEV) . The TEV recognition and cleavage site is 
30 ENLYFQ/G(TCS) (SEQ ID NO: 22) and was engineered just 5' 
to the BamHI cloning site and downstream of the linker 
and a 6-histidine tag (see Figure 8B) . His-tagged SF-1 
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protein was treated with TEV protease at a ratio of 1:200 
SF-1:TEV at 30 °C for 2 hours in a dialysis cassette in 
TEV cleavage buffer (200 mM NaCl, 100 mM TRIS pH 7.5, 20 
TtiM 3-mercaptoethanol) . As shown in Figure 8C, lower 
5 panel, incubation of the HIS- tagged SF-1 protein with 
increasing amounts of protease resulted in cleavage of 
the HIS-tag from the SF-1 fusion protein. 

Dialyzed material was reapplied to TALON™, and 
the bound and flow through fractions analyzed. The 2 0kDa 
TEV protease was observed as a faint band running 
slightly below cleaved SF-1 protein (arrow) . SDS-PAGE 
analysis of the bound (B) and flow through (FT) fractions 
showed retention of the TEV protease (Figure 8D, middle 
panel, black arrow) and the uncleaved HIS-TEV-SF-1 
protein (Figure 8D, middle panel, open arrow) as well as 
some non-specific retention of cleaved SF-1. Prior to 
subsequent chromatography, the flow through fraction 
(cleaved SF-1 protein) was concentrated and centrifuged 
at 100,000 g for 30 minutes ■ to remove some minor 
contaminants and large SF-1 (177-462) aggregates 
(Figure 8D, middle panel, pellet ^^P" ) . 

To reduce dimerization due to intermolecular 
sulfhydryl bridge formation, surface cysteines were 
covalently modified. The SF-1 fraction was treated 
25 overnight with 20 mM iodoacetate in 50 mM Tris pH 8.5; 

50 mM NaCl; and 5 mM EDTA. Following this reaction, the 
SF-1 (177-462) protein was subject to cat ion -exchange 
chromatography to remove minor contaminants present after 
the TEV cleavage and centrif ugation steps and to quench 
30 the iodoacetylation reaction. SF-1 (177-462) was eluted 
in a 150 mM NaCl step after a shallow gradient of 0-80 mM 
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NaCl was used to separate contaminating protein from 
SF-1 (177-462) . Figure 9, left panel, shows the A260 nM 
profile obtained after the cat ion- exchange (POROS-Q*^) 
chromatography. Two contaminating peaks flanked the 
5 major SF- 1 (177-462 ) protein peak. 

SF-1 dimers formed during the purification 
process were separated from the monomeric species with 
two back-to-back Superdex 200 gel filtrations. The 
results of the final Superdex 200 gel filtrations are 

10 shown. in Figure 9, right panel. The A260 nM profile 
shows both a minor dimeric and major monomeric 
SF-1 (177-462) species. Native polyacrylamide gel 
electrophoresis confirmed the presence of dimers in the 
first peak (#1) , and the presence of a single monomeric 

15 in the second, lower molecular weight peak (#2) . These 
results indicate that, despite the presence of large 
amounts of monomeric protein, SF-1 monomer is in 
equilibrium with higher aggregate forms. 

The yield of protein obtained with the 

20 purification procedure was greater than 50%. From 1 
liter of transformed bacteria starting material, 
about 15 mg purified SF-1 (177-462) were obtained. 
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EXAMPLE II 

REDUCTION OF SF- 1 ( 177 -4 62 ) DIMERS BY COVALENT 
MODIFICATION OF CYSTEINE RESIDUES 

This example demonstrates that covalent 
modification of cysteine residues can diminish 
SF-1 (177-462) dimer formation. 

Electrophoresis on native polyacrylamide gels 
indicated that SF-1 (177-462) preparations contained 
significant amounts of dimer, trimer or larger molecular 
weight complexes (see Figure 9, right panel) . 

Mass spectroscopy of aged SF-1 (177-462 ) protein 
preparations on a Voyager DE (MALDI-TOF) instrument 
confirmed the presence of SF-1 (177-462) dimers, as shown 
in Figure 10, top panel. To determine if the amount of 
aggregated material can be altered by covalent 
modification of cysteine residues, the SF-1 (177-462) 
preparation was treated with 2 0 mM iodoacetate overnight, 
thereby adding a carboxymethyl group to the reactive 
moieties that can form disulfhydryl bridges when 
unmodified. As shown by mass spectroscopy of 
iodoacetylated SF-1 (177-462 ) , the amount of dimer was 
reduced by about 6- fold following iodoacetylation (see 
Figure 10, lower panel) . These results indicate that 
reduction of SF-1 (177-462) preparations by 
carboxymethylation or covalent modification of cysteine 
residues can be used to diminish formation of SF-1 
aggregates and increase the stability of the monomeric 
form. 
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EXAMPLE III 

IDENTIFICATION OF POTENTIALLY SOLVATED CYSTEINES IN THE 
MtTRINE SF-1 LIGAND-BINDING DOMAIN 

Potentially solvated cysteines within the SF-1 
5 ligand-binding domain were identified by reactivity with 
iodoacetamide essentially as follows. Briefly, wild type 
SF-1 in storage buffer was diluted 1:10 in 50 mM Tris 
pH 8.5, 50 mM NaCl and 5 mM EDTA before iodoacetate was 
added to a final concentration of 20 mM and the mixture 

10 allowed to react. Following iodoacetylation, protein was 
treated with trypsin; mass spectrometry of the resulting 
tryptic fragments identified peptide sequences containing 
the most highly reactive cysteines. These peptides were 
compared to the known murine SF-1 sequence to assign 

15 cysteines 267, 302 and 423 as the most highly reactive 
cysteines present in SF-1. 

Three dimensional modeling of the murine SF-1 
ligand-binding domain was performed using the WHAT IF 
program (Vriend, J, Mol , Graphics 8:52-56 (1990)) based 

20 on the structure of human RXRa, where sequences for 

murine SF-1 were aligned against the sequence of the PDB 
entry of the crystal structure of the liganded LBD, 
chain A (Egea et al . , EMBO J, 19:2592-2601 (2000)). 
Models were constructed as described previously in 

25 Vriend, supra, 1990. From the three dimensional model of 
the murine SF-1 ligand-binding domain, amino acids 408 
and 413 were identified as additional potentially 
solvated cysteine residues. 
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In sum, these results identify cysteines C267, 
C302, C408, C413 and C423 as potentially solvated and 
available for intermolecular disulfide bond formation.. 

EXAMPLE IV 

PREPARATION OF SF-1 LIGAND- BINDING DOMAIN VARIANTS 
CONTAINING SERINE FOR CYSTEINE SUBSTITUTIONS 

This example describes preparation of 
SF-1 (219-462) variants containing two,, three or four 
cysteine to serine substitutions within the 
ligand-binding domain. 

Constructs were designed to contain 
substitutions at two or more cysteines based on the 
identification of potentially solvated cysteines 
described above. As for the wild type SF-1 (219-462 ) 
construct, each variant was cloned into BamHI and Xhol 
sites and contained a 6-histidine linker separated from 
residues 219 to 462 of SF-1 by a TEV protease site. Each 
of the SF-1 variants was designed to have a Gly-Ser 
amino-terminal sequence. The nucleic acid sequences of 
the SF-1 coding portions of Variant 1 (SEQ ID NO: 14), 
Variant 2 (SEQ ID NO: 16), Variant 3 (SEQ ID NO: 18) and 
Variant 4 (SEQ ID NO: 20) are shown in Figure 7. 

Constructs were prepared by Kunkel mutagenesis 
according to the method described in Kunkel et al., Meth. 
Enzvmol . 154: 367 (1987), using the primers shown in 
Table 1. 
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Table 1 


Primer 


Mutatio 
n 


Seguence/SEQ ID NO: 


A, reverse 


C267S 


5 ' -TCGGCCATTCGCGAGAGGAGGCTG-3 » t 
(SEQ ID NO: 28) 


B, reverse 


C302S 


5 • -CTCGCTCCAAGAGTTTTGCAGCAGTG-3 
1 

(SEQ ID NO: 29) 


C, reverse 


C423S 


5 ' -CACCTCCACCAGCGACAATAGCAAC-S ' 
(SEQ ID NO: 30) 


D, forward 


C408S, 
C413S 


5 ' -TACACCTTGAGCCACTACCCACACTCCG 
GGGACAAATTCC-3 ' (SEQ ID NO: 31) 


BamHI , 
forward 




5 ' -CTATCCAGAGGGATCCTCAGGAGG-3 ' 
* 

(SEQ ID NO: 32) 


Xhol , reverse 




5 ' -AGGAGTCTTCTCGAGGCAGTGGCA-3 ' 
(SEQ ID NO: 33) 


t Nucleotide substitutions underlined and in bold 
* Restriction sites indicated in bold 



To create Variant 1, Primer A (SEQ ID NO: 28) , 
Primer B (SEQ ID NO: 2 9), and Primer C (SEQ ID NO: 30) 
were used to produce serine substitutions at C267, C302 
15 and C423, generating the mutations in Variant 1. 

Primer D (SEQ ID NO: 31) was used to introduce serine 
substitutions at cysteines C413 and C423, producing the 
mutations in Variant 3. To create Variant 4, Variant 3 
was used as the template for mutagenesis with Primers B 
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(SEQ ID NO: 29) and C (SEQ ID NO: 30), thereby adding 
serine substitutions at C302S and C423S. 

Using the three different variant templates 
' produced above, the region spanning murine SF-1 amino 
5 acids 219 to 462 was amplified using forward BamHI primer 
SEQ ID NO: 32 and reverse Xhol primer SEQ ID NO: 33. The 
amplified segments containing newly created 5' BamHI and 
3' Xhol sites were ligated into pBH4, which is a 
modification of the vector pET19b that contains a SxHis 
10 tag and a cleavage site for the tobacco etch virus (TEV) 
protease (Hillier et al . , Science 284: 812-815 (1999),). 

The SF-1 variants were purified as described 
above in Example I with omission of iodoacetylation 
steps . 

15 The following yields were obtained. 





Table 2 




Purification of wild type SF-1 and receptor variants 


Construct 


% Yield 


% Purity 


Wild -type SF-1 


about 50% 


99% (aggregated) 


Variant 1 


about 50% 


99% 


Variant 3 


about 50% 


99% 


Variant 4 


Not available* 


Not available* 


* Expressed protein was insoluble 
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EXAMPLE V 

CHARACTERIZATION OF SF- 1 ( 2 1 9 > 4 62 ) VARIANTS CONTAINING 



SERINE FOR CYSTEINE SUBSTITUTIONS 



This example demonstrates that cysteine to 
5 serine substitutions within the SF-1 receptor 

ligand-binding domain can be used to produce properly 
folded receptor variants that exhibit increased monomer 
stability at high concentration as compared to the 
stability of the wild type SF-1 AFl/ligand-binding 
10 domain. 

Thermal denaturation of SF-1 (219-462) variant 
proteins was measured by monitoring circular dichroism 
(CD) spectra at 222 nm using a Jasco model J-720 
spectrophotometer with Peltier temperature controller and 

15 1 mm path cuvettes. Purified proteins (10 ]iM) were 
equilibrated in CD buffer (20 mM TRIS pH 8.0, 150 mM 
NaCl, 10% glycerol, 3 mM CHAPS, 2 mM TCEP) and 
centrifuged prior to all measurements. Profiles were 
determined by increasing the temperature from 2 0''C 

20 to go^'C with a gradient of 30^C per hour (an acquisition 
time of 4 seconds) with a data pitch interval of 0.2°C. 
Midpoints of the thermal transitions were estimated using 
the first derivations of the thermal denaturation curves. 

As shown in Figure 11, SF-1 Variant 3 displayed 
25 a single melting transition point, indicative of a 
properly folded protein. The melting point of the 
protein was approximately 48 °C. These results 
demonstrate that substitution of cysteines within the 
ligand-binding domain of SF-1 does not greatly perturb 
30 the structure of the ligand-binding domain and that this 
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strategy can be used to prepare properly folded SF-1 and 
other receptor variants . 

Dynamic light scattering analysis was performed 
to determine whether SF-1 variant protein was 
monodispersed (monomeric) and also to measure the radius 
of the protein. Samples were dialyzed and assayed 
at 50-120 \xM in the following buffer: 20 mM HEPES, pH 
7.4; 100 mM NaC; 1 mM EDTA; 1 mM CHAPS; and 10 mM DTT as 
described in Brown (Ed.), Dynamic Light Scattering: The 
Method and Some Applications Oxford: Clarendon Press 
(1993) . 

As shown in Figure 12B, preparations of SF-1 
Variant 1 {C267S/C302S/C423S) were about 80% monomeric. 
Analysis of .SF-1 Variant 3 (C408S/C413S; SEQ ID NO: 19) 
revealed that this mutant protein was more than 91.4% 
monomeric, with a stokes radius of 2.34 nm and a size of 
25 kDa (see Figure 12C) . Unlike SF-1 variants 1 and 3, 
the fraction of wild type SF-1 (219-462) was too low for 
accurate quantitation, as shown in Figure 12A. In 
contrast to wild type SF-1 (219-462) , relatively little 
aggregated protein was observed for either variant. 

These results indicate that SF-1 Variants 1 
and 3 display a single stable monomeric species in 
contrast to the scatter observed with wild type SF-1 
ligand-binding domain. 

SF-1 Variants 1 and 3 were subject to nuclear 
magnetic resonance (NMR) analysis. As shown in 
Figure 13, Variant 3 showed less aggregation, greater 
solubility (100 ]M) , and greater intensity and definition 
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of peaks than SF-1 Variant 1. Both cysteine variants 
also displayed significantly less aggregation than wild 
type protein. These data indicate that SF-1 Variants 1 
and 3 have greater overall stability of monomeric 
5 configuration than the same portion of unmodified (wild 
type) SF-1. 



Fluorescent peptides were used in equilibrium 
binding studies with SF-1 Variant 3 to determine if the 
variant was functional for coactivator binding. 

10 Coactivator peptides tested in the assay were selected 
from a set of 40 known nuclear receptor coactivators and 
corepressors and included all NR interaction boxes from 
GRIPl, SRC, PGC, NcoR, Smrt , Trap, Prip and Ripl40. 
Control coactivator peptides harbored mutations of 

15 Leucine to Alanine in positions 4 and 5 of the LXXLL 
motif (LXXAA) . Binding was performed essentially as 
described in Geist linger and Guy, J . Am . Chem . Soc . 
123:1525-1526 (2001), in the following binding buffer: 
20 mM Tris HCl pH 8.0, 100 mM NaCl, 10% glycerol, 

20 1 mM DTT, 0.01% NP-40 and 1 mM EDTA. Binding data with 
different peptides and their controls were obtained; the 
results with GRIPl and SRC peptides and SF-1 Variant 3 
are illustrated. 



As shown in the Figure 14, SF-1 Variant 3 bound 
25 specifically to GRIP peptides; this binding was not 
saturable up to 100 uM of protein. As expected, SRC 
coactivator peptides did not display specific binding to 
SF-1 Variant 3. In contrast, wild type SF-1 (219-462 ) 
bound neither GRIP nor SRC coactivator peptides, 
30 consistent with the fact that the wild type ligand 



binding domain was present as inactive dimers or other 
aggregates . 

All journal article, reference and patent 
citations provided above, in parentheses or otherwise, 
whether previously stated or not, are incorporated herein 
by reference in their entirety. 

Although the invention has been described with 
reference to the examples provided above, it should be 
understood that various modifications can be made without 
departing from the spirit of the invention. Accordingly, 
the invention is limited only by the claims. 



